Visual Semantics: Extracting Visual information from Text Accompanying Pictures

نویسندگان

  • Rohini K. Srihari
  • Debra T. Burhans
چکیده

This research explores the interaction of textual and photographic information in document understanding. The problem of performing generalpurpose vision without a priori knowledge is di cult at best. The use of collateral information in scene understanding has been explored in computer vision systems that use scene context in the task of object identi cation. The work described here extends this notion by de ning visual semantics, a theory of systematically extracting picture-speci c information from text accompanying a photograph. Speci cally, this paper discusses the multi-stage processing of textual captions with the following objectives: (i) predicting which objects (implicitly or explicitly mentioned in the caption) are present in the picture and (ii) generating constraints useful in locating/identifying these objects. The implementation and use of a lexicon speci cally designed for the integration of linguistic and visual information is discussed. Finally, the research described here has been successfully incorporated into PICTION, a caption-based face identi cation system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual Semantics for Reducing False Positives in Video Search

This research explores the interaction of textual and visual information in video indexing and searching. Much of the recent work has focused on machine learning techniques that learn from both text and image/video features, e.g. the text surrounding a photograph on a web page. This is useful in similarity search (i.e. searching by example), but has drawbacks when more semantic search is desire...

متن کامل

Comparative Approach to the Relationship Between Text and Hand Visual Language in Tahmasebi’s Shahnameh Pictures

The painters of Tahmasbi Shahnameh, in order to depict the text full of the story of Shahnameh, tried to convey emotions and excitement to the audience by using the visual language of the hand. Due to the multiplicity of applications of this type of nonverbal communication in different situations, the painter may have undergone changes in parts of her painting under the influence of various fac...

متن کامل

Learning the Semantics of Words and Pictures

We present a statistical model for organizing image collections which integrates semantic information provided by associated text and visual information provided by image features. The model is very promising for information retrieval tasks such as database browsing and searching for images based on text and/or image features. Furthermore, since the model learns relationships between text and i...

متن کامل

Interrogation of a University Classrooms in the Court of Semantics: Managerial Implications

The purpose of this article, within the framework of an interpretive study, was to study the semantics of a universitychr('39')s classrooms to create a critical awareness of the meanings of the symptoms and their functions at the context of physical artifacts, besides their managerial implications. To accomplish this goal, after taking pictures of the structural elements of the studied classroo...

متن کامل

Using Eye Movement Analysis to Study Auditory Effects on Visual Memory Recall

Recent studies in affective computing are focused on sensing human cognitive context using biosignals. In this study, electrooculography (EOG) was utilized to investigate memory recall accessibility via eye movement patterns. 12 subjects were participated in our experiment wherein pictures from four categories were presented. Each category contained nine pictures of which three were presented t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994